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Abstract 

Let F2 be the finite field of cardinality 2 n . For all large n, any subset A C F£ x F£ of cardinality 

v log log n 



log n 



must contain three points {(x, y) , (x + d, y) , (x, y + d)} for x, y, d G F£ and d / 0. Our argument is an 
elaboration of an argument of Shkredov [14], building upon the finite field analog of Ben Green [10]. The 
interest in our result is in the exponent on log n, which is larger than has been obtained previously. 

1 Main Theorem 

We are interested in extensions of Roth's theorem on arithmetic progressions in dense sets 
of integers to a two-dimensional, finite- field setting. Specifically, for a finite group G define 
the quantity rz{G) to be the cardinality of the largest subset of G x G containing no corner. 
A corner is triple of points of the form {(x, y), (x + d, y), (x, y + d)} with d ^ 0. 

While this concept is most interesting in the context of the groups G = Z^, it already 
makes sense-and is substantial-in the context of finite fields. In this paper, we only consider 
the case of G = Here and throughout this paper we write N = 2 n = jF^I- 

Theorem 1.1. r z (F") < N 2]o^f^L, 

y log log JV 

This bound is an improvement, in the setting of F£ , of the bounds provided by Shkredov 
[14, 15], and as simplified by Ben Green [10, 11]. The main point is that we elaborate on 
the 'Density Increment' procedure, obtaining a density increment on a set which is the 
intersection of sublattices in two distinct sets of bases. 

Our theorem is an example of the quantitative bounds on questions of arithmetic combi- 
natorics. We refer the reader to the papers of Gowers [6], and surveys by T. Tao [17] and 
Ben Green [8, 10] for more history. 

Erdos and Graham raised the question of quantitative bounds for rz, and this question 
was raised again by Gowers [6]. Ajtai and Szemeredi [1] first proved that rz(Z N ) = o(N 2 ). 
Furstenberg and Katznelson [4, 5] gave a far reaching extension, though their method of proof 
does not in and of itself permit explicit bounds. Solymosi [16], and V. Vu [18] provided such 
bounds, although of a weak nature. 
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I. Shkredov [13, 14] provided the first 'reasonable' bounds. We are using his ingenious 
argument, as explained and simplified by Ben Green [10] in the finite field setting. In 
particular Green showed that one could achieve an estimate in which r^(N)/N 2 decreased 
like (loglogiV)~ c where c could be taken to be 1/21. We find an additional extension of this 
argument, and sharpen some inequalities to obtain our Theorem. 

We comment that 'the finite field thesis' holds that questions of this type should first be 
studied in the context of finite fields. This is because one can implement many of the tools 
of analysis, e. g. convolution and Fourier transform, in that setting. In addition, one has the 
powerful concept of being able to pass to appropriate affme subspaces. Moreover, there are 
a range of methods that one can use to 'lift' the finite field argument to TLfq. See papers by 
Bourgain [2], Green and Tao [9] and Shkredov [15] for more information. 

Acknowledgment. One of us (M.L.) was supported in part by the NSF and the Guggenheim 
Foundation. One of us (W.M.) was supported in part by a VIGRE grant awarded to Georgia 
Institute of Technology. Both authors are grateful for the hospitality of the University of 
British Columbia. While there, we benefited greatly from conversations with Izabella Laba 
and Jozef Solymosi. Ben Green and I. Shkredov pointed out an error in an earlier version of 
this paper. We also thank the referee for additional corrections, which lead to strengthening 
of the main result. 

1.1 Outline of Proof 

Ben Green has provided a comprehensive outline of the method of proof [10], so we will be 
somewhat brief. 

Let i c Fj x Fj. It is natural to count the expected number of corners that A has. This 
expectation will be approximately what it should be if the 'Box norms' of A are small. These 
norms, we use three of them, are explicitly given in (2.8) — (2.10), and are a two dimensional 
analog of the 'combinatorial square' norms that play such a prominent role in the proof of 
Roth's Theorem [9,10,12]. It is therefore a certain measure of 'uniformity' 

An important difference between our approach and the previous ones is that we emphasize 
the role of three distinct coordinates in the problem. Two of these are the obvious X and 
Y coordinates, given by the canonical basis ei and e2 respectively. The third is the diagonal 
coordinate D, given by ei +e2- In this way, our argument resembles the ergodic theoretic 
arguments, and in particular that of Conze and Lesigne [3] who describe the characteristic 
factor for the ergodic averages 1 

N 

71=1 

where T 1 ,T 2 are commuting measure preserving transformations. 

Two of the three box norms employ this diagonal coordinate. Either being large is an 
'obstruction to uniformity' (See [17].) But, in contrast to the case of Roth's theorem, this 
obstruction to uniformity has no clear arithmetic consequence. It does imply, however, that 
the set A has an increased density on a sublattice. That is, there is a subsets X,Y,Dc W%, 
of relatively large density, for which A has a larger density on the intersection of X xY and 

1 The argument for two commuting transformations in this paper is complete. We arc grateful to Bryna Kra for pointing 
out this reference to us and to Emmaunel Lesigne for providing us with a copy of this paper. 
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diag 

X x D, where in the latter product, we are taking the product in the coordinate system 
(ei,ei + 62). See Lemma 3.4. Forming the intersection of two sublattices in this way is the 
main new contribution of this paper. 

It was a significant insight of Shkredov [14] that (1) one could, after an additional argu- 
ment, assume that X, Y and D had an arithmetic structure, namely that it was uniform in 
the sense of (2.2), see Lemma 3.8; and (2) the Box norm could still be used as an 'obstruction 
to uniformity' if X, Y and D were uniform. This is the content of the Lemma 3.1. 

If there is an obstruction to uniformity, an increment in the density of A can be found. 
One can find the obstruction to uniformity only a finite number of times, else the density of 
A would exceed one. Thus at some point, there would be no obstruction to uniformity, and 
so A would have a corner. All of the details of the proof are below. 

2 Preliminaries and Definitions 

We will let H C denote a subspace. The dimension of this subspace will decrease in the 
course of the proof. X, D,Y C H denote subsets. We adopt the notations of probability and 
expectation with respect to the counting measure on H . We view X as a subset of the first 
coordinate associated to basis element ei; Y as a subset of the second coordinate, associated 
to basis element e 2 ; and Dasa subset of the 'diagonal' coordinate associated to ei + e 2 . We 
will be working with subsets of 

S = X xYDX x D. (2.1) 

It is to be emphasized that the products are taken in the first instance in the ei and e 2 
coordinates; the second in the ei and ei + e 2 coordinates. 
Write the density of X in H as 

5 x :=nX\H)=\§ {) 

and similarly for 5y and Sd. In the iterative procedure used in this proof, these densities 
will be decreasing, somewhat rapidly. Throughout this paper we will not only be concerned 
with the density of the subsets X and D, but also with how 'uniformly distributed' they are 
in H. A quantification of this quality comes in the following definition: 

||X|| Uni :=sup™. (2.2) 

This definition only makes sense relative to a subspace H. If ||X||uni < V then we say that X 
is 77 -uniform. Here X represents the the Fourier transform of X which is defined as follows, 
in which uo = — 1 is a second root of unity: 

£(£): »(•'•)— 

It is immediate from the definition that a translate of a uniform set by an element of H 
is again uniform. After the deletion of a small subset, a uniform set is again uniform. Let 
E C X, we have 

||X-£|| Uni < \\X\\ VBi + F{E\X). (2.3) 
This proposition will be used repeatedly. 
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Proposition 2.4. Let X d H and denote the density of X in H by 8 X . Then 

[E deH \E yGH X(d-y)G(y)-5 x E yeH G(y)\ 2 ] 1/2 < Helium [E„G(y) 2 ] 1/2 (2.5) 
/or any function G. 

Notice that we are comparing a convolution to it's zero Fourier mode. The Proposition 
follows from Plancherel, and the definition of uniformity. A form of this inequality that we 
will use several times is this: For any two sets A,BdH, 

E xeX \E eY A(x + y)B(y)-¥(A)F(B)\ 2 < min{||A|| [/>(£) , \\B\\H^(A)} . (2.6) 

That is, only uniformity in one coordinate is required. 

Now, S is as in (2.1), and let Ad S. Write the density of A as 

5 := F(A | S) . 

This quantity will increase in the iterative procedure used in the proof. We define the 
balanced function of A to be the function supported on S as 

f(x,y) := A(x,y) - 5S . 

Our standing assumption will be 

\\X\\ VBi , ||F||u„i , PHuni <v, v = (55 x 5 Y 5 D f (2.7) 

where C is a large constant which we need not specify exactly, as its precise value only 
influences implied constants in our main Theorem. In the proofs of Lemmas, we will use the 
notation v 1 for a fixed, but unimportant, function of v, that tends to zero as v does. 
Further, for a function / : S — > C, define the following norm 

\\f\h = S D i E x ^ eX f(x,y)f(x',y)f(x,y')f(x',y') (2.8) 
y,y'& 

where we use the standard basis (ex, 62). This norm averages 'cross correlations' of / over all 
boxes in X x Y. When / is the balanced function of A, the norm being 'large' is an obstacle 
to A having the correct number of corners. 

We use two additional norms. In the (ex, ex + 62) coordinate system, 

WfWhx ■= S Y ^ x , x >exf(x,d)f(x',d)f(x 1 d')f(x',d') . (2.9) 

d,d'eD 

Similarly, with respect to the (e2,ex + 62) coordinate system, define 

II/HlV := S x %, y >eYf(y, d)f(y\ d)f(y, d')f(y' : d') . (2.10) 

d,d'eD 

The leading normalizations in these definitions are initially confusing, but chosen so that 
the norms are essentially bounded from above by the L°° norm of /. The point of these 
next propositions is that the quantities introduced above behave as they should, under the 
assumption of uniformity. In particular, (2.16) and (2.17) justify the normalizations in the 
definition of the Box norms. 
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Proposition 2.11. Let X,Y, D C H be as above, and let S be as in (2.1). Assuming (2.7) 
we have 

E xeX S = 6 D + 0(v'); (2.12) 

yeY 

E xeX S = 5 Y + 0(v'); (2.13) 

deD 

E deD S = S x + 0(v'); (2.14) 

yeY 

E xeX S{x, y)S(x + s, y)S{x, y + s) = S X S Y 5 2 D + 0(v'); (2.15) 
seH 

E xyeX S(x, d)S(x', d)S(x, d')S(x\ dt) = 5 Y + 0(v'); (2.16) 

d,d'eD 

E y>y , eY S(y, d)S(y', d)S(y, d')S(y\ d') = 5% + 0(v'). (2.17) 

d,d'eD 

In (2.16) we are using the (ei,ei + e2) coordinate systems, while in the (2.17) we are using 
the (e2,ei + e2) coordinate system. 

Proof. For (2.12), observe that 

S x 5 Y E xe xS = E XiyeH X(x)Y(y)D(x + y) = 5 X 5 Y 5 D + 0{S^% /2 v) . 

yeY 

This equality only requires uniformity in one coordinate. See (2.6). The equalities (2.13) 
and (2.14) are corollaries, after a change of variables. 

To see (2.15), we apply Lemma 2.19, with f = X and g — Y. Using the notation $ in 
(2.20), we have 

E x ,y tSeH S(x, y)S(x + s, y)S(x, y + s) 

= E x , y>seH X(x)X(x + s)D{x + y)D{s)Y{y)Y{y + s) 
= E x , yeH X(x)Y(y)D(x + y)$(x + y) + 0(v') 
= S x 6 Y E X;yeH D(x + + y) + 0(v') 

= 5 x 6 Y E xeH D(x)<f>{x)+0(v') 

It remains to estimate the last expectation, which we view as an inner product. Observe 
that 8(0) = \H\5 D 5 X 5 Y . And, by Plancherel, 

\E xeH D(x)<f>(x)-5 2 D 5 x 5 Y \ = \H 



^5(a)$(a 

< || J D|| Uni |i/|- 1 ^|8( 

< v\H\~ 2 5 D ^2\X{a)Y{a)\ 

< VOdO x Oy ■ 

Concerning (2.16), we use a similar proof to the one above. We follow the notation in 
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Lemma 2.19, and its proof. Set 



x') = E d Y(x + d)D(d)Y(x' + d) 



Lemma 2.19 implies that \& is well approximated by $. Hence, we can estimate 



5 2 X 5 2 D ■ (2.16) = E X)X > eH X(x)X(x')D(d)D(d')Y(x + d)Y{x' + d)Y(x + d')Y(x' + d') 

d,d'eH 

= E XtXl&H X(x)X(x')^(x,x'Y 

= E x>x , eH X(x)X(x')$(x + x') 2 + 0(v') 

= 6 2 x E xeH $(x + x') 2 + 0(v') 



«i#r 4 E ? («) 4 + ^) 



= 5 2 x 5 2 D 5*+0(v'). 

Here, we have used Lemma 2.19, uniformity in X, Plancherel, and uniformity in Y. The 
second equality (2.17) follows from the first. This completes the proof. 

□ 

Remark 2.18. There is a second way to see (2.15), which we only briefly indicate, since the 

diag 

method of proof is not self contained. Consider S = X xY f] X x Dasa subset of X x Y , 
and let A be it's balanced function. Namely A = S — ¥(S \ X x Y)X x Y. One can then 
define the Box norm, as does Shkredov 

H A llRcctBox = f ^x,x'ex&(x,y)A(x',y)A(x,y')A(x',y') 

It follows from the proof of (2.16), that we have ||A|| RectB ox ^ v'. Shkredov [1 I] showed that 
under this assumption, and uniformity in X and Y, that the set S has nearly the expected 
number of point in it. That is the content of his 'Generalized von Neumann Lemma.' 



Lemma 2.19. Let D be uniform, and let f,g be two functions on H. Define 



^ = Tfk E /(^"V" • (2-20) 



We have the inequality 



|2 



E Xiy£H \EJ(x + s)D(s)g(y + s) - $(x + y) 

<\\D\\ Vm [E x f{x) 2 ] 112 -[Eygiy) 2 ] 112 

Proof. Consider 

V(x, y) = EJ(x + s)D(s)g{y + s) 



1/2 



(2.21) 
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as a function on H x H . Expanding / in dual variable a and g in dual variable (3 we have 
V{x,y) = \H\~ 2 f{a)g{Pp a - x+l3 - y E s D(s)u {a+l3 > s 



This shows that iff (a, (3) = f(a)'g((3)\H\~ 1 D(a + (3). Clearly, $ of the Lemma consists of 
the reconstruction of iff from those Fourier coefficients (ct,f3) for which a + (3 = 0. And by 
Plancherel, the Lemma follows from 

Ma,f3)\>< |p||L£lftW)l a 

= \H\*-\\D\\ 2 Jni -E x f(x) 2 -E y g(y) 2 . 

□ 



3 Primary Lemmata 

Our first lemma is a generalized von Neumann estimate, a term coined by Green and Tao 
[7,9, 10]. It gives us sufficient conditions from which to conclude that A has a corner. 

Lemma 3.1 (Generalized von Neumann). Suppose that A C S with F(A | S) = 5; (2.7) 
holds; and we have the two inequalities 

5 X 5 Y 5 D 5 2 N > C , (3.2) 
max{||/|| n , ||/|| D ,x, \\f\\n,y}<^ 4 . (3.3) 

Then A has a corner. 

Here, and throughout this paper, C represents a large absolute constant. The exact value 
of C does not impact the qualitative nature of our estimate, so we do not seek to specify 
an optimal value for it. Also < c, «, k' < 1 are small fixed constants, which plays a role 
similar to C. 

Our second lemma tells us that if the conditions of our first lemma are not satisfied then 
we can find a sublattice on which A has increased density. 

Lemma 3.4 (Density Increment). For < k there is a constant < k! < 1 for which the 

diag 

following holds. Suppose that AcS = XxYf]X x D with ¥(A \ S) — 5, that f is the 
balanced function of A on S, and that 

max{||/|| D , ||/|| D)X , ||/||n,y}>^ 5/4 . 

Then there exists I'd, Y' cY , D' C D such that three conditions hold. 

either X' = X, or Y' = Y, or D' = D; (3.5) 

diag 

W(A \S')>6 + k'5 2 , S' = X' x Y' n X' x D' ; (3.6) 
¥{X' | X) , P(y' | Y) , ¥(D' | D) > k'5 2 . (3.7) 



7 



We need only refine two of the three sets X , Y and D above. Note that with uniformity 

diag 

in coordinate that is not refined, we then have that the set S — X' x Y n X' x D has 
about the expected number of points in it. 

Our third lemma is a modification of one from a note of Ben Green [10]. It tells us that 
we can find a uniform sublattice on which A has increased density which is important since 
it is a required premise in applying the generalized von Neumann Lemma. 

Lemma 3.8 (Uniformizing a Sublattice). Suppose that X,Y,D are as above. In addition 

1. X, Y, and D satisfy (2.7); 

2. X' CIJ'C Y, D' C D, with F(X' | X) > c5 2 and similarly for Y and D; 

3. Either X' = X, Y' = Y or D' = D; 

diag 

I S' = X' x Y' n X' x D'; 

5. F(A \S') = 5 + c5 2 ; 

6. dim(#) > C[5 4 (v") 2 ]-\ where < v" < 1 is fixed. 

Then there exists X" C X' , Y" C Y , D" C D' and H', H" , translates of the same subspace 
Ho < H , so that 

H^luni, Helium, Whm<v'\ (3.9) 

diag 

P(^4 | S") > 5+ f <5 2 , S = X" x Y fi X" x D" , (3.10) 

dim(Fo) > dim(H) - C[5 4 (v") 2 }^ , (3.11) 

¥(X" | H 1 ) > k5 2 ¥(X' I H) . (3.12) 

An inequality similar to the last one also holds for Y" and D" . In particular F(D" \ H' + 
H") > k5 2 F(D' I H) 

It is to be emphasized that H' and H" are translates of the same subspace of H < H, 
therefore H 1 + H" is also a translate of H . Thus, after a joint translation of A, X, Y and 
D, we can assume that H' and H" are in fact the same subspace H . It is this translation 
that is used in the iteration of the proof. 

4 Proof of Theorem 1.1 

Combining Lemmas 3.1 through 3.8 of the previous section yields the proof of Theorem 1.1. 
Since the proof is by recursion, we describe the conditional loop needed for the proof. 

Proof. Initialize X <- F» Y <- F™, D <- F' 2 \ S <- F™ xFJ, 1 , # <- F™. Likewise <J X , 5 Y , ^ 1. 
Fix a set A with density S in F 2 x F 2 . Initialize A <— A and 5 <— P(A | 5). Now we will 
iteratively apply the following steps: 

1. If max{||/|| D , ||/|| n> x, H/lhy} > apply Lemma 3.4. 
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2. If X', Y' or D' is not v = (55x>o~y'3d) c uniform, apply Lemma 3.8. Suppose these 
sets are as in the Lemma: subsets X" C X', Y" C Y', D" C D' and affine subspaces 
H', H" C H containing X", Y" , D" . After joint translation of X", Y", D", A and H', H", 
we can assume that H' = H" and are subspaces of H. 

3. Update variables: 

X<-X n , Y^Y", D^D", H<-H', 
5 X <- F(X" | H') , 5 Y <- P(y" | H') , 5 D *- F(D" \ H') , 

diag 

S^XxYHXxD, S^P{A\S). 

4. Observe that the density of the incremented A on the set S has increased by at least 
k,5q. Also, the incremented densities, 5x, Sy, and Sd have decreased by no more than 

Once this loop stops, Lemma 3.1 applies — provided that the initial dimension is large 
enough — and we conclude that A has a corner. This iteration must stop in < iterates, 
else the density of A on the sublattice would exceed one. Thus we need to be able to apply 
Lemmas 3.4 and 3.8 < Sq 1 times. In order to do that, both X and H must be sufficiently 
large at each stage of the loop. 

This requirement places several lower bounds on N = 2 n . The most stringent of these 
comes from the loss of dimensions in (3.11). Note that before the loop terminates, we can 
have Sx as small as 

Sx > (kSo)^- 1 . 
In order to apply Lemma 3.8 at that stage, we need 

N > 2 (cs y cs o\ 

From this condition we get the bound stated in the Theorem. □ 

5 Proof of Generalized von Neumann Lemma 

Proof of Lemma 3. 1 . Define 

T(J, g, h) = E x , S:yeH f(x, y)g(x + s, y)h(x, y + s). (5.1) 

Thus, T(A, A, A) is the expected number of corners in A. We show that this quantity is at 
least a fixed small multiple of 5^5y5|)5 3 . By assumption (3.2), C5 x S Y Sl,5 3 N 3 > 55xSySdN 2 . 
The left hand side is the expected number of corners in A, while the right is the number of 
trivial corners in A — that is the number of points in A. Thus A is seen to have a corner. 

Throughout the proof, it is convenient to make the substitution s — > x + y + s in the 
expression for T(/, g, h), thus 

T(/, g, h) = E Xi y !S f(x, y)g(y + s, y)h(x, x + s) . 

We are of course using the fact that we work in a field of characteristic two, but there is a 
similar substitution for any field. 



We make the substitution A = f + 8S to get 



T(A, A, A) = 5 3 T(S, S, S) (5.2) 

+ 5 2 T(f, S, S) + 5 2 T(S, f, S) + 5 2 T(S, S, f) (5.3) 

+ 5 T(f, f,S) + 6 T(S, f,f) + 5 T(f, S, f) (5.4) 

+ T(/,/,/). (5.5) 



We have grouped the terms according to the number of /'s that appear. 

The main term is the right hand side of (5.2). Using (2.15), we see that S 3 T(S, S, S) > 
l^^y^l)^ 3 . (Observe the difference in the normalizations on the expectations in (2.15).) 

All three terms in (5.3) are approximately zero, but we have to use uniformity to see this. 
For instance, we appeal to (2.5) and (2.7) to see that 

5 2 T(S, /, S) = 5 2 E x ^ H X(x)Y(x + s)D(x + y)f(y + s, y) 

= 6 2 5 x E x , y , seH Y(x + s)D(x + y)f(y + s,y) + 0(v') 
= 5 2 5 x 5 Y 5 D E ytSeH f(y, s) + 0(v') 
= 0(v') . 

In this line and below, v' is an unimportant function of v which tends to zero. 

The three terms in (5.4) are all controlled by appeal to Lemma 5.8. For instance, we have 
using the assumption about the maximal size of box norms (3.3) 

\6T(f, SJ)\<v' + 5(5 x 5 Y 5 D ) 2 \\f\\ n \\f\\ n , x < k5 7 / 2 (5 x 5 y 5 d ) 2 . 

The other two terms admit a similar bound. 

We bound the final term T(/, /, /), with the inequality Lemma 5.8: 

|T(/, f,f)\<v' + 5 1 ' 2 {5 x 5y5 D ) 2 \\f\\ ntX \\f\\ niY < k(5 x 5 y 5 d ) 2 5 3 . (5.7) 

Using the hypothesis on the Box norm, (3.3), and the equation (5.6) will prove the Lemma. 

□ 

Lemma 5.8. For fj e {/, S} we have the estimate 

{ll/olb ■ ||/i||d,y • ll/2||n,x 
ll/olln-ll/ilkyll/alb (5-9) 
||/o||n-||/i|| 2 -||/ 2 ||n,x 

Here, ||/|| 2 = 5 1 ' 2 while \\S\\ 2 = l. 

Proof. We prove an instance of the claimed inequalities: 

|T(/o,/i,/ 2 )| <v'+\\fM5 x 8 Y 8 D f\\h\\u,Y-\\f2\\u,x. (5.10) 

By a change of basis, this inequality implies the other two. 
Apply Cauchy Schwartz once, in the variables (x,y), to get 

|T(/o,/i,/ a )|<(E !B , w /o(x ) y) a ) 1/2 .U 1 / 2 
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The first term on the right is no more than Wfo^i^xSySD) 1 ^ 2 • As for the second term, it is 

U = E x , y D(x + y)\E s f 1 (y + s,y)f 2 {x,x + s)\ 2 

= E yiSjS ,{E x D(x + y)fi(y + s, y)h{y + s', y)} ■ {f 2 (x, x + s)f 2 (x, x + s')} 

Note that in the definition of U, we have inserted the term D(x + y), which arises from 
A(x, y). 2 We apply Proposition 2.4 to replace D(x + y) by 5d and then use Cauchy Schwartz 
again in the variables x, x' to get 

U = MUi-U a ) 1/2 + 0(t/); 

U x d ^ E yyeH \E seH f 1 (y+ s,y)h{y' + s,y')\ 2 = S x S Y S 2 D \\fi\\h,Y^ 

U 2 = E X7X , eH \E seH f 2 (x,x + s)f 2 (x',x' + s)\ 2 = S 2 x 5 Y 5 2 D \\f 2 \\ 4 n>x . 

A change of variables permits the identification of U\ and U 2 . This completes the proof of 
(5.7). 

□ 



6 Proof of Density Increment Lemma 

Proof of Lemma 3.4- We prove this assertion: Fix c < 1. There is a constant k so that 
the following holds. Assume that ||/||n > c<5 5 / 4 , and show that there are subsets X' C X, 
F'cYso that 

F(X' | X) , P(Y" | Y) > k5 2 , 

diag 

¥(A | X' x Y' n X' x D) > 5 + k5 2 . 

We emphasize that the box norm we use is the one given in (2.8). And we will not refine the 
diagonal coordinate. This is one instance of the Lemma, which by a change of coordinates, 
this will prove the Lemma as stated. 

We can assume that the fibers above points x G X, and y G Y, typically behave as 
expected. Namely, we assume that 

E xex \E yeY f(x,y)\ 2 <kW d , 
E yeY \E xeX f(x,y)\ 2 <k 3 6 2 6 2 d . 

for otherwise, we can apply Lemma 6.25 to conclude the Lemma. 

For a point (x, y) G A, consider N x = f {y' \ (x, y') G A}, N y = f {x' \ (x', y) G A}. These 
are the neighbors of x and of y, respectively. We need to consider points for which these sets 
are about as big as they should be. Set 

X" = {x G X | ¥(N X \Y) > k5 5 5 d , \E yeY f(x,y)\ < k65 d } , 
Y" = {y G Y | P(AyX) > k5 5 5 d , \E xeX f(x,y)\ < k55 d } . 

It is clear that these sets are most of X and Y respectively. In particular, in view of (6.1) 
we have 

F(X" | X) , F(Y" \Y)>l-5 2 . (6.2) 

2 Without this term, we would not get the right power of <5q in our estimates. 
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Clearly, we can assume that 

®*ex»f(x,y) <k5 2 S d (6.3) 
y&" 

for otherwise we already proved the Lemma. But, it is also the case that we can assume 

-K6 4 5 D <E xeX „f(x,y) (6.4) 

for if this inequality fails, we apply Lemma 6.26 to conclude the proof of the Lemma. 
We further note that we have 

® x , x >ex»f(x, y)f(x', y)f(x, y')f(x', y') > . (6.5) 
y,y'&" 

If we were taking the expectation over X and Y, this would follow from the assumption 
that ||/||n > cS 5 ^ 4 . As we are taking the expectation over X" and Y", we need to show 
that taking the expectation over the complement of X" we get an appropriate upper bound, 
namely 

\E x , x , eX l x _ x „(x)f(x,y)f{x',y)f(x,y')f{x',y')\ < v' + kS 5 S 4 d . (6.6) 
y,y'€Y" 

Three similar inequalities hold, so using the assumption that ||/||n > c<5 5 / 4 , taking < k < y 
we will see that (6.5) holds. 

To see (6.6), first observe that by definition of X", 

E x>x < eX l X - X „(x)N x (y)D(x + y')D(x' + y)D(x> + y') < v' + ^H A D . (6.7) 
y,v'€Y" 

This does not prove (6.6) since f(x,y) is not supported on N x (y). But, we also have the 
similar inequality 

E,y e x lx-x»(x)D(x + y)N x (y')D(x' + y)D(x' + y')v' + ^H A D . (6.8) 
y,y'eY" 

Of course / = SS — A, so we can estimate 

5\^x, x >ex l X -x"{x)D(x + y)f(x, y')f(x', y)f(x', y') \ 
y,y'eY" 

<v' + k5H 4 d + 5 2 \E XtX , £X l X - X „(x)D(x + y)D(x + y')f{x', y)f(x', y') \ 
y,y'eY" 

<v' + k5 5 5 4 d + 8 2 S 2 D \E x y £X l x _ x „(x)f{x', y)f{x', y')\ 

y,y'eY" 

<v' + k5H 4 d + 6 4 6 2 D E x/eX \E yeY »f(x',y)\ 2 
<v' + k6 5 5 4 d + k5 6 5 4 d . 

This completes the proof of (6.6). 

To find the subset on which A has increased density, we consider any subset of the form 

diag 

N y x N x n Ny x D, where y G Y" and x G X" . Clearly we are interested in the largest 
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increase in density, for which we estimate 



t An{N y xN x nS"}\ Q"(A,A,A,A) 
SUP \N y xN x n S"\ ~ QT (A, A, A, S) ' l °- yj 



X&X",y& 
{x,y)eA 



s" = x" x y" n x" d x g D , 



Q"(foJxj2j3) = 1& x , x >ex»fo(x,y)fi(x',y)f 2 (x,y')f 3 (x',y'). 

y,y"eY" 

Here, we note that for (x, y) G X" x Y" we have \A fl {N y x i\r x fl S"}\ > 0, so that we are 
not dividing by zero in (6.9). By definition, we have 

¥(N y x N x | X x Y) > k 2 5 w 5 2 D . 

And by uniformity, we have 

F(N y xN x n S") = E xl ^ H N y (x')N x (y')D(x' + y') 

>v' + k 2 5 10 5 3 D 5xS Y > . 

In the last inequality, v' is a function of the uniformity constant. 

There is a gain of regularity in passing to the expectations on the right hand side of (6.9). 
Expand A = f + 5S in the last place in Q(A, A, A, A) to see that 

Q"(A,A,A,A) Q"(A,A,A,f) 
Q"{A,A,A,S) ~ Q"{A,A,A,S) 

and so we should show that the last fraction is at least c'5 2 , which we do by showing that 

Qi\A,A,A,f) > c'8 5 Sj ) , (6.10) 
< Q"(A,A,A,S) < 105 3 5j). (6.11) 

This we will do, assuming one more condition. If this last condition fails, we will get a 
density increment of 8 2 . 
Define 

a 2 x = 85 2 D E xe x"\E y eY"f(x,y)\ 2 , a 2 Y = 55 2 D E yeY "\E xeX "f(x,y)\ 2 , 

By (6.1) and the definitions of X" and Y", one can see that these two quantities are at most 
kiJ 3 ^, and hence are only a small fraction of the major term in the considerations below. 
Using A — SS + f, Q"(A, A, A, S) has the expansion 

Q"{A, A, A, S) = S 3 E x>x , eX »D(x + y)D{x + y')D{x' + y)D(x' + y') (6.12) 
y,y'ev" 

+ 35 2 E x y £X „f(x, y)D{x + y')D{x' + y)D{x' + y') (6.13) 
y,y'£Y" 

+ 6E x , xleX „f(x, y)f(x, y')D(x' + y)D(x' + y') (6.14) 
y,y'eY" 

+ 5E XiXleX „D(x + y)f(x, y')f{x', y)D{x' + y') (6.15) 
y,y'eY" 

+ SE x>x 'ex»f(x, y)D(x + y')f(x', y)D{x' + y') (6.16) 

y,y>&» 

+ E xyex »/(x, y)D(x' + y') . (6.17) 

y,y'&" 
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Clearly, the uniformity in D is relevant. The right hand side of (6.12) is S 3 5j ) plus a term 
controlled by uniformity; the term in (6.13) is, by (6.4), at least —3k5 6 , plus a term controlled 
by uniformity; (6.14) is a x , plus a term controlled by uniformity; (6.15), by (6.3) and (6.4), 
obeys the inequality 

(6.15) <v' + 55 2 D \E xeXjyeY f(x,y)\ 2 < v' + k 2 6 3 5 a d ; 

(6.16) is approximately in ay] while the last term (6.17) is not one that admits an obvious 
control, and we write 

(6.17) — v' + A, A d ^ f 5 D E x y eX »f(x,y)f(x,y')f(x',y). (6.18) 

y,y'eY" 

We can assume that |A| < K,S 3 Sj), otherwise we apply Lemma 6.31 to finish the proof of the 
Lemma, getting a density increment of the order of 6 2 . This proves (6.11). 
The expression Q(A, A, A, f) admits a very similar expansion. 

Q"(A, A, A, f) = 5 3 E x y eX „D(x + y)D(x + y')D(x' + y)f(x', y') (6.19) 

+ Q 2 + Q 3 (6-20) 

+ E XtX , eX „f(x, y)f(x, y')f(x', y)f(x', y') , (6.21) 
y,y'€Y" 

Q 2 = 6 2 {E x y eX »f(x,y)D(x + y')D(x' + y)f(x',y') (6.22) 
y,y'&" 

+ E xyeX „D(x + y)f(x, y')D{x' + y)f(x', y') 
y,y'&Y" 

+ E x y eX „D(x + y)D(x + y')f(x', y)f{x\ y') } , 
y,y'&" 

Q 3 = 5{E x y €X »D(x + y)f(x, y')f(x', y)f(x', y') (6.23) 
y,y'eY" 

+ E x>x , ex „f(x, y)D{x + y')f(x', y)f(x', y') 
y,y'&" 

+ E XtX , eX „D{x + y)f(x, y')f(x', y)f(x', y')} . 
y,y'&" 

The right hand side of (6.19) is greater than — n5 3 5%, plus a term controlled by uniformity 
by (6.3) and (6.4); the term in (6.21) is the box norm, which by (6.5) is at least c'S 5 6%; here 
of course, c' > is fixed in advance, and we take k much smaller than c'; the terms which 
make up the definition of Q 2 all involve two /'s, and after taking uniformity into account 
each individual term is positive, and so can be ignored as we are obtaining a lower bound 
for Q(A, A, A, f); the three terms in the definition of Q 3 are all of the form v' + A where 
A is defined in (6.18). In particular, we have already assumed |A| < n5 3 5p. For < n 
sufficiently small, this proves (6.10), and so the proof of this Lemma. 

□ 

We use the following simple variant of the Paley Zygmund inequality. It states, in par- 
ticular, that a random variable, bounded in L°° norm by one, with mean zero, and standard 
deviation a, must be at least a constant multiple of a on a set of probability proportional to 
the variance a 2 . 
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Proposition 6.24. Let 1 < p < oo. Then there is a c > so that for all 1 < p < oo, and 

random variables Z with — 1 < Z < 1, EZ = 0, and ~E\Z\ P = a p . Then, F(Z > ca p ) > c p a p . 

Proof. In fact, we can take c = |. We assume that the conclusion is false and seek a 
contradiction. Since = we have 

— EZl{z <0 } = EZ1{^ >0 } 

caP} 

< 2ca p . 

With this, and the fact that p > 1 while Z is bounded by 1, we can now estimate 

a p = E\Z\ P = E\Z\ p l {z<0} + E\Z\ p l {z>0} 

< 2EZ1 {Z>0} < Aca p . 

This is a contradiction. 

Lemma 6.25. If it is the case that 

E xeX \E yeY f(x,y)\ 2 >c5 2 5 2 D 
then there is a set I'd for which ¥(X' \ X) > jk5 2 on which we have 



□ 



(liaid 



F(A | X' x Y n X' x" D) > 5 + ^5 2 

Proof. The set of x G X for which 

25 D < \E yeY f(x,y)\ < \E yeY D(x + y)\ 

has probability that is controlled by uniformity, and hence is negligible. Thus, the Lemma 
follows immediately from the Paley Zygmund inequality. □ 

diag 

Lemma 6.26. Suppose that there is a sublattice S = X" x Y H X" x D with 

F(X" | X) , P(y" | Y) > 1 - A , (6.27) 
P(^4 | S") <8-r. (6.28) 

diag 

Then, there is a sublattice S' = X' x Y' PI X' x D on which 

F(X' | X) , V(Y' \ Y)>\t, (6.29) 
F(A \S')>5 + ktX- 1 . (6.30) 
Proof. Notice that the density of A on S — S" must be strictly larger than 5. Namely, 

<5[1 - F(S" | S)} + tF(S" I S) 



F(A | S - S") > 



> 5 



1 - F(S" | S) 
F(S" | 5) 



1 - F(S" | S) 
> 5 + ktA" 1 . 

But, clearly 5 — 5"' is a union of three sublattices, and on one of these three, A must have 
density at least 5 + ktA" 1 . We must have F(X" \ X) < 1 — |r, otherwise we contradict 
(6.28). This finishes the proof. □ 
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Lemma 6.31. Fix c > 0. If it is the case that 

\E x J(x,y)E x ,f(x' J y)E y ,f(x,y')\ > c6 3 5 3 D (6.32) 
Then, there is a constant d = c'(c), I'd, Y' cY with 

P(X' | X) , P(y' | Y) > c'5 2 , (6.33) 

diag 

F(A \X' xY'nX' x D) > 5 + c'5 2 . (6.34) 

Proof. By uniformity in D, we have 

P(|E,/(x,y)| >25 D ) <v'. 

That is, the effective L°° bound on E y f(x,y) is 25e>. 
Holders inequality and the assumption (6.32) implies 

c5 3 5 3 D < [E^|/(a;,y)| 3 ] 1/3 - [E x , y D(x + y)\EJ( X ,y)\ 3/2 ■ \E x f(x,y)\ 3/2 } 2/3 
< 25 1 / 3 5 1 I { 3 [E x , y D(x + y)\E y/ f(x,y')\ 3/2 ■ \E x ,f(x', y) | 3/2 ] 2/3 
<v' + 25 l ' 3 5 D [E x \E y ,f{x, y')f 2 ■ E y \E x ,f(x', y)\ 3/2 ] 2/3 . 

Here it is essential that we insert the term D(x + y) when we apply Holders inequality. Note 
that uniformity in D is then used to obtain the full power of 5d in the last line. 
Thus, we must have e. g. 

E y \E x f(x,y)\ i/2 >c"5 2 5f 
Then, the conclusion of our Lemma follows from the Paley Zygmund inequality. 

□ 



7 Proof of Lemma 3.8 

We include a proof of this Lemma as we are requiring a more than is claimed in e. g. Ben 
Green's survey [10]. In particular, we claim that all three sets X, Y, D can be uniformized. 

Let us indicate the central way that uniformity is used in this proof. See [10, Lemma 
3.4(1)]. 

Proposition 7.1. For any subset X C H , there is a partition of H into two affine subspaces 
H' and H" for which 

i{P(X | H') 2 + F(X | H") 2 } > P(X | H) 2 + l\\X\\ 2 Vni . (7.2) 

The following more technical Lemma describes a key inductive procedure in the proof. 

Lemma 7.3. Let < t, u < 1 be positive parameters. Let U C H. Suppose that dim(iJ) > 
10(*u 2 ) -1 . Then, there is a partition V of H , so that writing V = U U J\f , 

1. All H' eV have dimension dim(if') > dim(if) — 4(tn 2 )~ 1 ; 

2. ||£/||uni < u for allH' e U; 
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3. Forbu = F(U \ H), 

P(|J{#' I H' e Af} | H) < tSu. 

Remark 7.4. We will not use this Lemma as stated, but rather the more complicated variant 
that follows. We include this statement and proof for clarity's sake. Some of the notation 
above is taken from Ben Green's note [10]. In particular, U is for 'uniform', and Af is for 
'non-uniform.' 

Proof. The proof is an inductive procedure, though we will not define the collections of 
hyperspaces U and Af until the conclusion of the iteration. Initialize variables 

G <-{#}■ 

Also initialize a counter m <— 0. Given Q, define 1Z to be those H' £ Q for which \\U R 
#'||uni > u. WHILE 

PflJ{#' I H' e K}) > t5 v 

update m <— m + 1. And for each if' £ 1Z apply Lemma 7.3 to H' . Thus, H' = H[U H' 2 for 
which 

\{F(U | H[f + F(U | H' 2 ) 2 } > F(U | i/') 2 + X • (7-5) 

Update Q-1ZU{H[,H' 2 \ H' ell}. When each £ K has been so split, set V m <- Q. 
The WHILE loop then repeats. 

Once the WHILE loop stops, return the value of m, the sequence of partitions Vi, . . . , V m , 
and Q = V m . Define 

U = {H' £ Q | ||Xn#'||uni < «}, 

and Af == Q — U. The subspaces in Q have dimension at least dim(if ) — m. 

Once the WHILE loop stops, it is clear that the conclusions (2) and (3) of the Lemma 
hold. We concern ourselves with the first conclusion about the dimension of the subspaces 
involved. Observe that at each iterate of the WHILE loop, the dimensions of the partition can 
only lose one dimension. It suffices to estimate the number of iterates of the Loop, which is 
the counter m. 

Consider the quantities 

Sj = P ( f/ I H ') 2 H H ' | H) = E E(U | Vjf , 

which are the mean square densities of U relative to the partitions Vj. (Here we are relying 
on the usual notations for conditional second moments.) Obviously these quantities are less 
than 5jj = F(U \ H). Each H' which is split in (7.5), the mean square density in H' is 
increased by \u 2 . And, this takes place, at each iterate of the loop, in a portion of the 
whole space H that is at least probability t5jj. From this, we see that Sj + \ > Sj + \t8uu 2 . 
Consequently, mtSuu 2 < S m < Sjj. This proves (1), and so the proof is complete. 

□ 

The more technical statement that we need is as follows. Whereas in the first Lemma, 
we have a single subset U C H and construct a 'good' partition, in this statement we have 
three subsets Ui,U 2 ,U 3 C H and construct a single 'good' partition in the product space 
H x H. 
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Lemma 7.6. Let < t, u < 1 be positive parameters. Let U\,U 2 ,U 3 C H. Suppose that 
dim(H) > 10(tu 2 ) _1 . Then, there is a partition V of H x H, so that writing V = U U A/i U 
M 2 U A/3, we /iave the following. 

1. For all Vi x V 2 G 

dim(Vi) = dim(^ 2 ) > dim(#) - 4(tM 2 )" 1 . 

Moreover, V\,V 2 as affine subspaces of H , are translates of each other. 
3. For all V x x V 2 G ZY, max^^H^ n V^||uni < u awd ||^3 H (Vi + V 2 )|| Un i < 

P(U^ I ^1 x ^2 G A/}} I #) < % j = 1,2, 
PflJ{^i + V 2 | Vi x V 2 G A/" 3 } I i?) < tcV 3 . 

Remark 7.7. If we did not insist on the second conclusion, that the relevant subspaces 
be translates of one another, we could simply take a product of partitions arising from 
Lemma 7.3. Due to the prominent role of the diagonals in our problem, this is an essential 
point for us. 

Proof. We describe the recursive procedure. Initialize variables 

Q <- {H x H}. 

and three counters rrij <— 0, j = 1, 2, 3. The recursive procedure we describe will return the 
partition we need, as well as some auxiliary data that we need to prove the Lemma. 
Given Q, define TZj, j = 1, 2, 3 by 

Uj = {Vi x V 2 G Q I \\Uj n Vj Uni > u] , j = 1,2 

K 3 = {v l xV 2 eQ\ \\u 3 n (Vi + v 2 )|| Uni > u} • 

For j = 1,2 

IF PflJOS I Vi x V 2 G 7^}) > % (7.8) 
THEN update <- rrij + 1. WHILE 7^- ^ 

• Apply Lemma 7.3 to Vj. Thus, V,- = V- U V," for which 

|{p(t/,- 1 v,') 2 +P(t^ I v") 2 } > nuj I v 3 f + Wj n ^IIL- (7-9) 

• (This point is a departure from the previous proof.) Let k be the other index. 3 Since 

is a translate of Vj, we can choose translates V£ and V fc " of V^' which also partition 

V k . 

• At this point, let us observe that we will have V{+V 2 , VZ+Vf, V"+V^ V/'+Vf C Vi+V 2 . 
And indeed, all of these subspaces are affine translates of V/, hence these four subspaces 
are made of two equal pairs, which partition Vi + V 2 . 

3 That is, if j = 1, then k = 2. Notice that we are enforcing a partition on Vj. that does not necessarily have anything to do 
with increasing conditional variances. 
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• Update 

q <- (Q - {Vi x v 2 }) u {v{ x vi, v{' x v£, v; x u 2 ", u/' x u 2 "} 

<- n 3 - {V! x v 2 }. 

When the WHILE loop stops, define Vm 1 +m 2 +m- i = Q and aj(rrij) = mi + m 2 + m 3 . 

We now describe the procedure as applied to the set U 3 . That the products in Q are 
products of translates of the same subspace plays a critical role in this formulation. In 
particular, for V\ x U 2 £ ^ V\ + V2 is a translate of Vi (and V2). 

IF ¥(\J{V 1 + V 2 \V 1 xV 2 eTZ 3 })>t5 Us (7.10) 

THEN update m 3 <— m 3 + 1. For each Vi xV 2 E 1Z 3 

• Apply Lemma 7.3 to W = V x + V 2 . Thus, W = W U W for which 

|{P(£/ 3 1 W) 2 + nu 3 1 iy") 2 } > P(t/ 3 1 ^) 2 + ||c/ 3 n . (7.11) 

• Since the spaces are translates of W, we can choose translates VJ and V^" of W which 
also partition Vj, j = 1,2.. 

• Update 

Q (Q - {K x U 2 }) U {U/ x U 2 ', U/' x U 2 ', V{ x U 2 ", U/' x U 2 "} 

n 3 <- n 3 - {Ux x u 2 }. 

Repeat these steps until TZ 3 is exhausted. Then, define V mi+m2+m3 = f Q and a 3 (rrij) = 
mi + ^2 + "i3- 

Iteratively apply the three conditionals, two in (7.8) and one in (7.10). STOP when 
all three conditionals fail. Return the values of rrij, the sequence of partitions of Vj for 
1 < j < mi + m 2 , the 'partition times' {<Jj{n) \ 1 < n < rnj} and collection Q. 

Define 

Mj = {Vi x U 2 e Q I n Vallum > u} , j = 1,2 

and similarly define A/3. Set U = Q — Ni — J\f 2 — A/3. 

All subspaces chosen in this way have dimension at least equal to dim(H) — mi — m 2 — m 3 . 
We need only provide upper bounds on the rrij, as all the other claims of the Lemma are 
evident from the construction. 

We claim that rrij < (Stu 2 ) -1 , and for this, we can use the previous proof, with one 
additional fact. Let ir be a finite partition of a probability space, and suppose that tt' refines 
7i. Then, for any random variable Z we have 

E(E(Z I tt) 2 ) < E(E(Z I tt') 2 ) 

Here, we are using a standard notation for conditional expectation given tt. This is a simple 
martingale fact. Indeed, let Y = E(Z | tt') — E(Z | 7r), and observe that E Y ■ E(Z | 7r) = 0. 
The random variable in question is Uj. Set 

Sj >n = E E(C/j I V n ) 2 , 1 < n < m = f mi + m 2 + m 3 . 
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We have just seen that the Sj n are increasing in 1 < n < m. They are obviously at most 
5u r And by construction, and in particular using (7.5) and (7.11), at each time at which the 
corresponding conditional is satisfied, we increase these numbers by a definite amount. At 
each iterate of the loop, in a portion of the whole space H that is at least probability tS^, 
where this uniformity constant is at least u. From this, we see that 

Suj > > s j,^(e-i) + tu 2 5 Vj , 1 < i < rrij , j = 1, 2, 3 . 

Therefore, rrij < (tu 2 )~ l . The proof is complete. □ 

Proof of Lemma 3.8. Let us assume that e.g.D' — D. Apply Lemma 7.6 to the sets X' = U±, 
Y' = U 2 , D = D' = U 3 , with t = f^5 2 and u = v" . Let V be the partition of H x H that 
this Lemma gives us. 

Define two subsets E\ and £ 2 of V by 

E x d ^ f {vi x v 2 e V I P(x' n Vj | Vj) < t} . 

We define E 2 similarly, with X' replaced by Y', and £3 with X' replaced by D. These are 
the 'empty' portions of the partition which nearly avoid X' or Y' entirely. 
Consider 

x = \J{x' nv 1 \v 1 xV 2 eSi uM} 

and similarly define Y . Then by construction, F(X PI X') < 2t = |<5 2 . Observe that 
uniformity in D then implies that 

E XtyeH X (x)D(x + y)Y\y) = v' + P(X | H)8 D 8 Y , . 

That is, we can assume 

diag diag 

P(X xY'nXo x D I X' x Y' n X' x D) < ~5 . 

The same inequality holds with Xq replaced by X and Y by Yq. The import of this is the 
inequality 

¥(A \X 1 xY 1 f]X 1 d x g D) > 5 + f 5 2 , X l = X - X , Y x d = Y - Y . 

Let V\ X V 2 G P — 5i — £ 2 — M\ — M 2 . In particular, X fl V\ is a uniform set, obeying 
||X fl Vi||uni < v " ■ As a consequence, we have 

E xeVl ,yev 3 X(x)Vi(x)D(x + y)Y(y)V 2 (y) = v + F(X \ Vx)¥(D \ V x + V 2 )¥(Y \ V 2 ) . 

Here, v is a function of i>" that need not concern us. Set 

D Q = {DnV 1 + V 2 \V 1 xV 2 e£ 3 U A3}. 

It follows that 

diag diag 

P(Xi xY 1 nX 1 x D \X 1 xY 1 nX 1 x D)< \b 2 . 
Then, for D\ = D — Dq, we have 

diag _ 

¥(A \X 1 xY 1 HX 1 x D 1 )>5 + f S 2 . 
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From this, we see that we must have some H' x H" G U for which 

diag r o 

F(A | H' x H" nlix^nl! x D 1 )>S+ -S 2 . 

We take X" =X 1 nV L , Y" = Y l nV 2 , D" =D 1 nH' + H" . These are all uniform subsets, 
satisfying (3.9); the second conclusion (3.10) is the inequality above; the lower bound on 
the dimension of H' and H" follows from Lemma 7.6; and the final conclusion follows from 
the fact that the element of the partition that we chose, H' x H" is not in the collection 
^lU^U £ 3 . 

□ 
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